The Protein Information Resource: an integrated public resource of functional annotation of proteins

نویسندگان

Cathy H. Wu

Hongzhan Huang

Leslie Arminski

Jorge Castro-Alvear

Yongxing Chen

Zhang-Zhi Hu

Robert S. Ledley

Kali C. Lewis

Hans-Werner Mewes

Bruce C. Orcutt

Baris E. Suzek

Akira Tsugita

C. R. Vinayaka

Lai-Su L. Yeh

Jian Zhang

Winona C. Barker

چکیده

The Protein Information Resource (PIR) serves as an integrated public resource of functional annotation of protein data to support genomic/proteomic research and scientific discovery. The PIR, in collaboration with the Munich Information Center for Protein Sequences (MIPS) and the Japan International Protein Information Database (JIPID), produces the PIR-International Protein Sequence Database (PSD), the major annotated protein sequence database in the public domain, containing about 250 000 proteins. To improve protein annotation and the coverage of experimentally validated data, a bibliography submission system is developed for scientists to submit, categorize and retrieve literature information. Comprehensive protein information is available from iProClass, which includes family classification at the superfamily, domain and motif levels, structural and functional features of proteins, as well as cross-references to over 40 biological databases. To provide timely and comprehensive protein data with source attribution, we have introduced a non-redundant reference protein database, PIR-NREF. The database consists of about 800 000 proteins collected from PIR-PSD, SWISS-PROT, TrEMBL, GenPept, RefSeq and PDB, with composite protein names and literature data. To promote database interoperability, we provide XML data distribution and open database schema, and adopt common ontologies. The PIR web site (http://pir.georgetown.edu/) features data mining and sequence analysis tools for information retrieval and functional identification of proteins based on both sequence and annotation information. The PIR databases and other files are also available by FTP (ftp://nbrfa.georgetown.edu/pir_databases).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Protein family classification and functional annotation

With the accelerated accumulation of genomic sequence data, there is a pressing need to develop computational methods and advanced bioinformatics infrastructure for reliable and large-scale protein annotation and biological knowledge discovery. The Protein Information Resource (PIR) provides an integrated public resource of protein informatics to support genomic and proteomic research. PIR prod...

متن کامل

A Family Classification Approach to Functional Annotation of Proteins

The high-throughput genome projects have resulted in a rapid accumulation of genome sequences for a large number of organisms. To fully realize the value of the data, scientists need to identify proteins encoded by these genomes and understand how these proteins function in making up a living cell. With experimentally verified information on protein function lagging far behind, computational me...

متن کامل

Measuring the effectiveness of human resource information systems in national iranian oil company an empirical assessment

While the growth of MIS investment and its influence is making MIS evaluation ever more indispensable, little attention has been paid to assessing and communicating system effectiveness. This paper attempts to empirically assess the effectiveness of integrated human resource information system in Iranian oil industry. As suggested by recent research, the widely accepted IS success model is...

متن کامل

SInCRe—structural interactome computational resource for Mycobacterium tuberculosis

We have developed an integrated database for Mycobacterium tuberculosis H37Rv (Mtb) that collates information on protein sequences, domain assignments, functional annotation and 3D structural information along with protein-protein and protein-small molecule interactions. SInCRe (Structural Interactome Computational Resource) is developed out of CamBan (Cambridge and Bangalore) collaboration. Th...

متن کامل

Functional Annotation of Two Hypothetical Proteins Reveals Valuable Proteins Involved in Response to Salinity: An in silico Approach

Through the exponential development in the specification of sequences and structures of proteins by genome sequencing and structural genomics approaches, there is a growing demand for valid bioinformatics methods to define these proteins function. In this study, our objective is to identify the function of unknown proteins from UCB-1 pistachio rootstock and specify their class...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

Nucleic acids research

دوره 30 1 شماره

صفحات -

تاریخ انتشار 2002

The Protein Information Resource: an integrated public resource of functional annotation of proteins

نویسندگان

چکیده

منابع مشابه

Protein family classification and functional annotation

A Family Classification Approach to Functional Annotation of Proteins

Measuring the effectiveness of human resource information systems in national iranian oil company an empirical assessment

SInCRe—structural interactome computational resource for Mycobacterium tuberculosis

Functional Annotation of Two Hypothetical Proteins Reveals Valuable Proteins Involved in Response to Salinity: An in silico Approach

عنوان ژورنال:

اشتراک گذاری